On the consistency of information lters for

نویسنده

  • Chris Mellish
چکیده

A common practice when ltering a case-base is to employ a ltering scheme that decides which cases to delete, as well as how many cases to delete, such that the storage requirements are minimized and the classiication competence is preserved or improved. We introduce an algorithm that rivals the most successful existing algorithm in the average case when ltering 30 classiication problems. Neither algorithm consistently outperforms the other, with each performing well on diierent problems. We also discuss recent work which indicates that ltering can be harmful in some domains, and we conclude that when class deenitions are not made up of homogeneous regions a non-resolvable ambiguity arises between noise and genuine class exceptions. Consistency over many domains, we argue, is very hard to achieve when deploying a ltering algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Countable Lters on !

Two countable lters on ! are incompatible if they have no common innnite pseudo-intersection. Letting a(P f) denote the minimal size of a maximal uncountable family of pairwise incompatible countable lters on !, we prove the consistency of t < a(P f).

متن کامل

On the Consistency Strength of Irregular Ultra lters

The proof of the consistency of the existence of irregular ultra lters requires large cardinals. Foreman, Magidor, and Shelah forced the existence of a fully irregular ultra lter on a successor cardinal from a huge cardinal. That some large cardinal strength is necessary was shown by Ketonen (0) and Donder, Jensen, and Koppelberg (a measurable cardinal). We considerably improve this lower bound...

متن کامل

بررسی میزان تطابق زبان نمایه‌سازان، نویسندگان و برچسب‌گذاران در پایگاه اطلاعاتی اریک و مندلی

Objective: The purpose of this study was to identify the language consistency between indexers, authors and taggers in the ERIC and Mendeley databases. Methodology: This survey was conducted using content analysis methods and techniques to evaluate the language consistency between indexers, authors and taggers in the ERIC and Mendeley databases and also to determine common keywords. The sample ...

متن کامل

On Hyperideal Structure of Ternary Semihypergroups

In this paper, we introduce and study the concepts of prime left, semiprime left and irreducible left hyperideals in ternary semihyper- groups and investigate some basic properties of them. We introduce the concepts of hyper lter and hypersemilattice congruence of ternary semi- hypergroups. We give some characterizations of hyper lters in ternary semihypergroups. Some relationships between hype...

متن کامل

A Symbiotic Approach to Arc and Path Consistency Checking

The resolution of constraint satisfaction problems heavily relies on the use of local consistency enforcement procedures which are used to lter the problems before or during their resolution. While procedures based on arc-consistency are almost a standard, path-consistency checking is often neglected because it is costly and it lters out pairs of assignments instead of single assignments. This ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1949